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A METHOD AND APPARATUS FOR THROTTLING AND DISTRIBUTING 
DATA TRANSMISSIONS ACROSS A NETWORK 



5 BACKGROUND OF THE INVENTION 

1. Field of Invention 

The present invention is directed to a method and apparatus for throttling and 
distributing data transmissions across a network. 

2. Description of Related Art 

10 Presently, terminals, such as service switching points, send requests to servers, 

such as Service Control Points. These requests represent message exchanges between the 
involved terminals and servers. A request begins with an initial query message being sent 
by the terminal to the server. After processing the initial query message, the server may 
send a response message back to the terminal. Depending on the circumstances, the 

1 5 terminal may then send a subsequent query(s) to the server requesting further processing 
which may in turn result in subsequent responses being sent back. The request ends when 
the last pair of request/optional response is processed. 

Occasionally, a server may become overloaded when terminals send requests 
faster than the server can process the requests. As a result, the overload may cause 

20 considerable delay in servicing any subsequent requests. This problem can occur more 
frequently when the terminals have higher processing capabilities than the servers. The 
problem is also common when terminals send requests in a system having a set of 
heterogeneous servers (i.e., servers with different processing capabilities from each 
other). 

25 SUMMARY OF THE INVENTION 

The present invention provides a throttling unit for throttling and distributing data 
transmissions between terminals and servers in a communication system. The throttling 
unit determines a current status of the servers in the system by querying each of the 
servers at a predetermined interval. Alternatively, the servers may independently report 

30 their status to the throttling unit. The current status of the servers can further include 
receipt of an overload notification from a server indicating any or all of the servers are 
overloaded (i.e., demand for a servers resources exceeds the server's resources). 
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Once the throttling unit has determined the current status of the servers, the 
throttling unit can adjust transmission rates of terminals to control the amount of data 
received by each of the servers, based on the corresponding current status of each of the 
servers. The transmission rate can also include a local load coefficient which can be 
adjusted to either increase or decrease the aggregate transmission rate of a terminal to the 
servers, such increase or decrease can depend on the status of the servers. 

A data transmission can subsequently be sent from a terminal to the servers based 
on the transmission rate determined for each of the servers. Sending a data transmission 
may further include blocking a portion of the data transmissions based on a current value 
of a local load coefficient. 

The present invention can further improve server delay characteristics, reduce the 
probability of server overload, and improve aggregate server throughout. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The preferred embodiments of the present invention will be described with 
reference to the following figures, wherein like numerals designate like elements, and 
wherein: 

Fig. 1 is an exemplary block diagram of a system according to the present 
invention; 

Fig. 2 is an exemplary block diagram of a throttling unit of Fig. 1; 

Figs. 3a-3j are exemplary data structures for storing information corresponding to 
servers in the memory of Fig. 2; 

Fig. 4 is a flowchart outlining an exemplary operation of a server according to the 
present invention; 

Fig. 5 is a flowchart outlining an exemplary operation of a client according to the 
present invention; 

Fig. 6 is a flowchart outlining an exemplary operation of a terminal according to 
the present invention; and 

Fig. 7 is a flowchart outlining another exemplary operation of a terminal 
according to the present invention. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Fig. 1 is an exemplary block diagram of a throttling and distributing system 100 
according to the present invention. The system 100 includes a network 101 having a 
plurality of terminals 102-106 and servers 108-1 12 in communication with the network 
101 via communication links 118. The system further includes a throttling unit 1 14 
connected with the network 101 via a communication link 118. The throttling unit 1 14 
may perform network monitoring and control functions of the network 101. 
Additionally, the throttling unit 114 can monitor and control data transmissions between 
the terminals 102-106 and the servers 108-112. 

The network 101 may be a single network or a plurality of networks of the same 
or different types. For example, the network 101 may include a local telephone network 
(such as a Bell Atlantic network) in connection with a long distance network (such as an 
AT&T long distance network). Furthermore, the network 101 may be a data network or a 
telecommunication network in connection with the data network. Any combination of 
telecommunication and data networks may be used without departing from the spirit and 
scope of the present invention. For the purposes of discussion, it will be assumed that the 
network 101 is a single data network. 

Terminals 102-106, servers 108-1 12, and throttling unit 1 14 are in communication 
with network 101 over communication links 118. These communication links 118 may 
be any type of connection that allows for the transmission of information. Some 
examples include conventional telephone lines, digital transmission facilities, fiber optic 
lines, direct serial/parallel connections, cellular telephone connections, satellite 
communication links, local area networks (LANs), Intranets and the like. 

Terminals 102-106 can be devices of any type that allow for the transmission 
and/or reception of communication signals. For example, terminals 102-106 can include 
CPEs, CPE modems, Digital Subscriber Line (DSL) equipment, DSL systems, cable 
modems, set top boxes, telephones, wireless telephones, cellular telephones, Personal 
Digital Assistants (PDAs), computer terminals, database terminals, pagers, facsimile 
machines, answering machines and the like. The servers 108-1 12 can be devices of any 
type that can receive and respond to data transmissions received from the terminals 102- 
106. Furthermore, each server 108-1 12 can have a different processing capacity. 
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Prior to steady-state operation of the system 100, and periodically during 
operation, the throttling unit 1 14 can determine a current status of each of the servers 
108-1 12. The current status may be obtained by the throttling unit querying each of the 
servers 108-1 12 or having each of the servers 108-1 12 independently and periodically 
5 report their current status to the throttling unit 1 14. Regardless of the technique for 
acquiring the current status, the current status can represent a current capacity of the 
server 108-1 12 to process data transmissions from the terminals 102-106. 

For example, if a server 108 was functioning at 100% of its operational capacity, 
the current status of the server 108 may be "good". However, if the server 108 was 

10 temporarily operating at only 50% of its operational capacity, possibly because of repairs 
being done to the server 108, then the current status of the server 108 may only be "fair". 
Additionally, if the server 108 were altogether removed from service, due to possible 
technical problems, then the current status of the server 108 may be "poor". While the 
current status has been described as good, fair and poor, it is to be understood that the 

1 5 current status may also be described as a numerical value, such as a percentage. 

The current status of the servers 108-1 12 can include an indication as to whether 
any of the servers 108-1 12 is overloaded. An individual one of the servers 108-1 12 can 
be considered overloaded when a demand for that server's processing resources exceeds 
its available processing resources. An overloaded server can cause delay in responding to 

20 a data transmission from one of the terminals 102-106. Again, the throttling unit 1 14 

may obtain a server's overload status from queries sent to the individual servers 108-112 
or by the servers 108-1 12 periodically and independently transmitting overload status 
information to the throttling unit 114. 

In operation, the throttling unit 1 14 can monitor and control the data 

25 transmissions of the terminals 102-106 which are sent to the various servers 108-1 12. 
The control of the data transmissions from the terminals 102-106 can be based on the 
transmission rate which the throttling unit 1 14 determines based on the current status of 
each of the servers 108-1 12. The transmission rate is a rate at which a terminal 102-106 
is permitted to transmit data to a particular server 108-1 12. 

30 For example, assume that all of the terminals 102-108 are operating properly and 

capable of performing at 100% of their processing capacity. Furthermore, assume that 
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the servers 108-1 12 are all capable of processing data at a same rate. Accordingly, each 
of the three servers 108-1 12 would be equally capable of processing one-third of the total 
data transmissions received from terminals 102-106. The transmission rate for each of 
the server 108-1 12 would then be one-third (1/3), and the throttling unit 1 14 could then 
5 monitor and control the data transmission of the terminal 102-106 so that approximately 
one-third (1/3) of the total data transmissions of each of the three terminals 102-106 is 
distributively transmitted to each of the servers 108-1 12. Alternatively, the throttling unit 
1 14 can transmit the transmission rate corresponding to each of the servers 108-1 12 to the 
terminals 102-106 and the terminals 102-106 can then control their own data 

10 transmissions so that one-third (1/3) of their total data transmissions is transmitted to 
each of the servers 108-1 12. In this manner, a single server 108 does not become 
overworked while the other servers 110-112 remain idle. 

While the throttling unit 1 14 is shown as an independent unit coupled to the 
network 101, it can also be incorporated into the terminals 102-106, servers 108-112 

1 5 and/ or may be distributed throughout the network 101. For example, the throttling unit 
1 14 may be made part of the various central offices or servers (not shown) employed by 
the network 101 which are distributed throughout the network 101. Any configuration 
that permits monitoring and control of the network 101 can be used without departing 
from the spirit and scope of the present invention. 

20 Fig. 2 is an exemplary block diagram of the throttling unit 114. The throttling 

unit 1 14 may include a controller 200, a memory 202 and a network interface 204. The 
above components are coupled together through a control/signal bus 230. The 
controller 200 controls the operation of the throttling unit 1 14 and communicates with the 
network 101 through the network interface 204. 

25 As described above, periodically the controller 200 gathers the current status of 

each of the servers 108-1 12 via the network interface 204. The current status of each of 
the servers can then be stored in the memory 202. Based on the stored current status, the 
controller 200 can then control the data transmission of the terminals 102-106 to 
respective servers 108-112. 

30 Figs. 3a-3d are exemplary data structures 300 of the memory 202. The data 

structure 300 includes a server ID field 302 which contains a unique identifier of each of 
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the servers 108-1 12. As shown, the server IDs in field 302 can correspond to the 
numbering of the servers numbering 108-1 12 in Fig. 1. 

Field 304 contains the current status corresponding to each of the servers in field 
302. The current status of a server corresponds to a current ability of the server to 
5 process data. For the purposes of this discussion, it will be assumed that the current 
status of a server may be one of good, fair, or poor. As shown in Fig. 3a, the current 
status of all of the terminals is "good". As described above, the current status of a server 
may also be described numerically, such as by a percentage. 

Field 306 contains information related to an overload status of a server 108-1 12. 
10 As described above, a server is overloaded when the amount of processing required to 
process incoming data transmissions exceeds the processing capacity of the server 
108-112. 

Field 308 contains a load coefficient Q corresponding to each of the servers. The 
load coefficients of each of the servers may be the probability of the terminals 102-106 

1 5 distributing a data transmission to one of the servers 108-112 rather than blocking the 
data transmission. Because the load coefficient Q represents a probability, acceptable 
values for the load coefficient ranges from 0 to 1. Each terminal 102-106 can block a 
data transmission according to the value of Q by blocking (1-Q)*L number of data 
transmissions out of a total L number of data transmissions. 

20 Field 310 contains a transmission rate corresponding to each of the servers. As 

described above, the transmission rate can define a portion of the total data transmissions 
that is to be received by each of the servers 108-1 12. Alternatively, the transmission rate 
can be a portion of a particular terminal's 102-106 data transmissions which are sent to a 
particular server 108-112. 

25 Examples of operation of the throttling unit 114 will now be described with 

reference to the data structures 300 shown in Figs. 3a-3d. Referring first to the Fig. 3a, 
all of the servers 108-1 12 are shown to have a "good" current status and none of the 
servers are overloaded. Accordingly, the controller 200 has set the load coefficient of 
each of the servers 108-1 12 to be one (1) and the transmission rate of each of the servers 

30 108-1 12 is set to be one-third (1/3). As described above, this means that one-third (1/3) 



Docket No.: 1999-0482 7 

of all the data transmissions of the terminals 102-106 is transmitted to each of the servers 
108-112. 

Referring to Fig. 3b, field 304 shows that the server having server ID 108 has a 
"fair" current status, while the servers having server IDs 1 10 and 1 12 both have a "good" 
5 current status. Field 306 shows that none of the servers are currently overloaded. 
Accordingly, the load coefficient for each of the servers is equal to one. 

Assume for this example that the server having server ID 108 has a current status 
that is equal to "fair" because the server is temporarily being repaired. Accordingly, the 
server is not operating at foil capacity. For instance, the server 108 may only be 

10 operating at 50% of full capacity. Accordingly, the controller 200 may calculate the 

transmission rates as shown in field 310 to take into account the current status of each of 
the servers 108-112. 

For example, the transmission rate corresponding to the server having server ID 
108 is one-ninth (1/9) while the transmission rates of the servers having server IDs 1 10 

15 and 1 12 is four-ninths (4/9). The difference in transmission rates between server 108 and 
servers 110-112 reflects the differing current status of each of the servers. For example, 
only one-ninth (1/9) of the data transmission being sent from terminals 102-106 are now 
sent to server ID 108 while eight-ninths (8/9) of the transmission are divided equally 
between servers 110 and 1 12. 

20 Referring now to Fig. 3 c, the server having server ID 108 has been determined by 

the controller 200 to have a current status of "poor". The current status of server 1 10 is 
"good", while the current status of server 1 12 is "fair". Furthermore, as can be seen from 
field 306, the server having server ID 108 is currently overloaded, while the servers 110 
and 112 are not. 

25 Based on the current status and overloaded status of each of the servers 108-1 12, 

the controller has generated a set of transmission rates which are shown in field 310. The 
transmission rate for the overloaded server, server 108, is zero. This means that none of 
the terminals 102-106 are currently sending data transmissions to the server 108. In 
contrast, the servers 110 and 112 now have an increased transmission rate. Furthermore, 

30 the server 110 which has a current status of "good" must now take on more of the 

transmission from the terminals 102-106 than the server having ID 1 12 and a current 
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status of "fair". Accordingly, the transmission rate of server 1 10 is three-fourths (3/4) 
while the transmission rate of server 1 12 is only one-fourth (1/4). 

Referring now to Fig. 3d, the current status of server 108 and server 1 12 is "fair", 
while the current status of server 1 10 is "good". However, in Fig. 3d, the field 306 
5 indicates that all of the servers 108-1 12 are overloaded. 

In response, the controller 200 may decide to limit the output of each of the 
terminals 102-106 by reducing the load coefficient of a terminal transmitting to each of 
the servers 108-1 12. As shown in field 308, the load coefficient corresponding to each of 
the servers 108-1 12 is now set to 0.5. Because of this, when a terminal 102-106 attempts 
1 0 to transmit to any of the servers 1 08-1 12, the server will be restricted to transmitting only 
one-half of the total data transmissions which it may wish to transmit. The load 
coefficient of 0.5 may indicate that half of the data transmissions are transmitted to the 
servers 108-1 12 while half of the data transmissions are discarded by the terminals 
102-108. 

1 5 Furthermore, field 3 1 0 of Fig. 3d indicates that the transmission rate of the server 

108 and server 1 12 is set to one-ninth while the transmission rate of the server 1 10 is set 
to seven-ninths. As described above, the transmission rates are set to reflect the current 
status of each of the terminals 102-106 as indicated in field 304. Accordingly, one-ninth 
of the total transmissions from the terminals 102-106 are being sent to the server 108, 

20 while at the same time, one-half of the transmissions from the terminals 102-106 which 
are sent to the servers 108-1 12 are actually being transmitted. Accordingly, the loads on 
all of the servers are reduced in proportion to their current status. The overload status, as 
shown in field 306, can be removed once the servers 108-1 12 are no longer overloaded. 
Figure 3e-3j are alternative exemplary data structures 350 of the memory 202. 

25 The data structure 350 includes a server ID field 352 that is similar to the server ID field 
302 of the data structure 300. The data structure 350 also includes an overloaded field 
354, a load coefficient field 356, and a transmission rate field 358 which are similar to the 
overloaded field 306, the load coefficient field 308, and the transmission rate field 310, 
respectively, of data structure 300. 

30 An alternative example of operation of the throttling unit 1 14 will now be 

described with reference to data structure 350 shown in Figures 3e-3j. Referring first to 



Docket No.: 1999-0482 9 

Figure 3e, none of the servers are shown to be overloaded. Accordingly, the controller 
200 has set the load coefficient of each of the servers 108-1 12 to be one (1), and the 
transmission rate for each of the servers 108-1 12 is set to be one third (1/3). As described 
above, this means that one third (1/3) of the data transmission of the terminals 102-106 is 
5 transmitted to each of the servers 108-1 12. 

Referring to Figure 3f, when a total load from the terminals 102-106 increases and 
causes the servers 108-1 12 to become overloaded as shown in field 354, the controller 
200 may reduce the load coefficient corresponding to each of the servers 108-1 12 by a 
predetermined percentage, here the percentage is reduced by 5 percent. Therefore, as 
10 shown in field 356, the load coefficients corresponding to each of the servers 108-1 12 is 
now set to 0.95. The transmission rates for each of the servers 108-1 12 remains 
unchanged. 

As shown in Figure 3f, the servers 108-1 12 still remain overloaded even after a 
reduction of in the load coefficient of 5 percent. Accordingly, the controller 200 again 

1 5 reduces the load coefficient by 5 percent. The new value of the load coefficient as shown 
in field 356 of Figure 3g is now 0.9. Again, the transmission rates for each of the servers 
108-1 12 remains unchanged. In response to the reduction, the overloaded status in field 
354 of each of the servers 108-1 12 now shows that none of the servers are overloaded. 
Accordingly, as shown in Figure 3h, the load coefficient for each of the servers 

20 108-1 12 may be increased by 5 percent. As shown in field 356, the load coefficient for 
each of the servers 108-1 12 is now 0.95. Again, the transmission rate in field 358 for 
each of the servers 108-1 12 remains unchanged at one third (1/3). . 

It is important to note that the load coefficient in column 356 may continue to 
oscillate between point 0.9 and 0.95 depending on the total load from the terminals 102- 

25 106. Alternatively, the load coefficient may return to one (1) when the load from the 

terminals 102-106 decreases to a point where the servers 108-1 12 can collectively handle 
the total load. 

An alternative example of the throttling unit 1 14 will now be described with 
reference to data structure 350 shown in Figures 3i and 3j. Referring first to Figure 3i, 
30 assume that servers 110 and 1 12 are running at full capacity, while the server 

corresponding to server ID 108 is experiencing a partial failure. In the initial state, the 
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servers 108-1 12 are all exhibiting a non-overload condition, having a load coefficient of 
one (1) and a transmission rate of one third (1/3). 

Referring now to Figure 3j, as the controller 200 becomes aware that the server 
108 is experiencing a partial failure and is now overloaded, the controller 200 will reduce 
5 the transmission rate of data sent to the terminal 108 by a predetermined amount; in this 
example, the predetermined amount is 5 percent. Accordingly, the transmission rate 
corresponding to server 108 in field 358 is reduced by 5 percent while the transmission 
rate of the terminals 1 10, 1 12 are each increased by one half of 5 percent (2.5 percent) in 
order to make up for the servers 108 ? s decreased capacity. As described above in the 

10 above example, the process may continue until the server 108 is no longer overloaded. 

Fig. 4 is a flowchart outlining an exemplary operation of a throttling unit to 
determine whether a server is overloaded. Initiation of the process begins in step 400. In 
step 410, the server may waits for a predetermined period of time, such as 10 seconds. In 
step 420, the server determines if it is overloaded. If the server is not overloaded, or is 

15 normally loaded, the process returns to step 410. If the server is overloaded, the server 

advances to step 430 where the server sends an overload notification to the throttling unit. 
After the server sends the overload notification, the server returns to step 410. 

Fig. 5 is a flowchart outlining an exemplary operation of a terminal to determine 
whether a server is overloaded. Initiation of the process begins in step 500 and proceeds 

20 to step 510. In step 5 10, the terminal waits for an overload notification from a server for 
T seconds. This period of time may be set to, for example, 15 seconds. In step 520, the 
terminal verifies if notification for a particular server was received. If notification was 
received, the terminal advances to step 530; otherwise, if notification was not received, 
the terminal advances to step 540. 

25 In step 530, the terminal updates an overload status of the particular server to 

"overloaded." In step 540, the terminal updates an overload status of the server to 
"normal." For example, the terminal may maintain an overload status of a server if the 
client receives an overload notification. Additionally, the terminal maintains a normally 
loaded status of the server if the terminal has not received an overload notification for a 

30 period of time such as, for example, T seconds. In this manner, the terminal can maintain 
a status of all of the servers. 
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In step 550, the terminal determines if all of the servers are done reporting. If all 
of the servers are done, the terminal advances to step 510. If all of the servers are not 
done reporting, the terminal advances to step 560. In step 560, the terminal advances to a 
next server and then returns to step 520. 
5 As described above, the throttling unit 114 can also maintain a local load 

coefficient Q. The load coefficient Q is the probability of the terminals 102-106 
distributing a data transmission to one of the servers 108-1 12 rather than blocking the 
data transmission. Because the load coefficient Q represents a probability, acceptable 
values for the load coefficient range from zero to one. Each terminal 102-106 can block 
10 data transmissions according to a value of Q by blocking, for example, (1-Q)*L number 
of data transmissions out of L transmissions where L represents a number of data 
transmissions. 

As also described above, the terminals 102-108 may additionally maintain a 
number of transmission rates w(j), where w(j) is the weight of the j-th server. 

1 5 Transmission rates w(j) is the probability of a client distributing a non-blocked data 
transmission to a j-th server. Each terminal 102-106 may distribute new non-blocked 
data transmission among the servers 108-1 12 according to transmission rate values by 
distributing to a j-th server, for example, w(j)*Ll number of data transmissions out of a 
total of LI non-blocked data transmissions. The sum of all w(j) transmission rates will be 

20 equal to 1 . During a second period of time, for example R seconds (R>T), each terminal 
102-106 can update its local load coefficients and all weights. It may not be required that 
different terminals 102-106 perform the upgrades simultaneously. The upgrade rules can 
be performed according to the following flowcharts. 

Fig. 6 is a flowchart outlining another exemplary operation of a terminal or 

25 throttling unit. This flowchart illustrates the periodic updating of the local load 

coefficients Q and the transmission rates w(j) by a server. Initiation of the process begins 
in step 600 and proceeds to step 610. 

In step 610, the terminal waits for a period of time R. The period may be set to, 
for example, 15 seconds. In step 620, the terminal determines if all of the servers are 

30 overloaded. The overloaded determination can be based on a local load status of the 
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servers. If all of the servers are not overloaded, the terminal advances to step 640. If all 
of the servers are overloaded, the terminal advances to step 630. 

In step 630, the terminal decreases a local load coefficient. This decrease in the 
local load coefficient may increase the probability of the terminal blocking data 
transmission. For example, the local load coefficient Q may be decreased by a value of 
0.05. The process then advances to step 610. 

Alternatively, in step 640, the terminal may determine if some of the servers are 
overloaded. This determination can be based on a local load status of the servers. If none 
of the servers are overloaded, the process advances to step 660; otherwise, if some of the 
servers are overloaded, the process advances to step 650. In step 650, the terminal can 
modify the local transmission rate w(j) of the servers. The terminal can modify the local 
transmission rate w(j) to move some of the load from the overloaded servers to the 
normally loaded servers. For example, the terminal can decrease the transmission rate 
w(j) of the overloaded servers by some percentage, for example, 5%. The terminal may 
additionally increase the transmission rate w(j) of the normally loaded servers by the 
different percentage, for example, P%, so that the sum of the weights for all the servers 
after the update still stays equal to one. P may be calculated as 0.05 multiplied by sum of 
all weights for normally loaded servers estimated prior to weight increase and, then 
divided by the sum of all weights for overloaded servers estimated prior to their decrease. 
The process then advances to step 610. 

In step 660, the terminal determines whether any of the servers is overloaded. 
This determination may be based on the local load status of the servers. If at least one of 
the servers is overloaded, the process returns to step 610. If none of the servers are 
overloaded, the process advances to step 670. In step 670, the terminal increases the local 
load coefficient Q. Increasing the local load coefficient increases the total server load 
without redistributing the load. This increase can result in a decrease in the probability of 
the terminals blocking or discarding requests. For example, the terminal may increase the 
local load coefficient Q by some value, for example, 0.05. The local load coefficient may 
be increased up to a value of one. The process then returns to step 610. 

Fig. 7 is a flowchart outlining another exemplary method for throttling data 
transmission. This flowchart illustrates the generating and distributing of new data 
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transmissions by the terminals. Initiation of the process begins in step 700 and proceeds 
to step 710. In step 710, the terminal generates a new data transmission. 

In step 720, the terminal determines whether to block the transmission. The 
transmission can be blocked based on a current value of the local load coefficient Q. For 
5 example, the determination of blocking the data transmission may be determined 

according to a probability blocking the transmission of (1-Q). If the client determines not 
to block the data transmission, the process advances to step 740. The terminal may send 
the data transmission to one of the servers 108-1 12 based on the current values of the 
local transmission rates w(j). For example, the transmission may be sent to the server j 

1 0 with probability w(j). If the terminal determines to block the data transmission, the 

process advances to step 730. In step 730, the terminal blocks the data transmission and 
advances to step 750 where the process ends. 

The method of this invention is preferably implemented on a programmed 
processor. However, throttling unit 114 may also be implemented on a general purpose 

15 or special purpose computer, a programmed microprocessor or microcontroller and 

peripheral integrated circuit elements, an ASIC or other integrated circuit, a hardware 
electronic or logic circuit such as a discrete element circuit, a programmable logic device 
such as a PLD, PLA, FPGA or PAL, or the like. In general, any device on which resides 
a finite state machine capable of implementing the flowcharts shown in Figs. 4-7 may be 

20 used to implement the processor functions of this invention. 

While this invention has been described with specific embodiments thereof, it is 
evident that many alternatives, modifications, and variations will be apparent to those 
skilled in the art. Accordingly, the preferred embodiments of the invention as set forth 
herein are intended to be illustrative, not limiting. Various changes may be made without 

25 departing from the spirit and scope of the invention. 
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WHAT IS CLAIMED IS : 

1 LA method of controlling data transmissions in a network between at least one 

2 terminal and at least one server, comprising: 

3 determining a current status of the at least one server; 

4 determining a transmission rate of the at least one terminal based on the 

5 current status of the at least one server; and 

6 adjusting the transmissions from the at least one terminal to the at least one 

7 server based on the transmission rate. 

1 2. The method of claim 1, wherein the step of determining the current status of 

2 the at least one server comprises receiving an overload notification from one of at least 

3 one server and updating a local status indicator for the one of at least one server. 

1 3. The method of claim 1, wherein the step of adjusting the transmission further 

2 comprises modifying at least one local load weight to move a load from at least one 

3 overloaded server to at least one non-overloaded server. 

1 4. The method of claim 1, wherein the step of determining the current status 

2 comprises determining an overload status of each server based on whether any server is 

3 overloaded. 

1 5 . The method of claim 4, wherein the step of adjusting the transmission further 

2 comprises adjusting a local load coefficient based on the overload status. 

1 6. The method of claim 5, wherein the step of adjusting the local load 

2 coefficient includes decreasing the local load coefficient if the overload status indicates 

3 that all of the servers are overloaded. 

1 7. The method of claim 5, wherein the step of adjusting the local load 

2 coefficient comprises increasing the local load coefficient if the overload status indicates 

3 that none of the servers are overloaded. 

1 8. The method of claim 1, wherein the step of adjusting the transmission further 

2 comprises modifying at least one local load weight if a portion of the at least one server is 

3 overloaded. 

1 9. An apparatus that controls data transmissions on a network between at least 

2 one terminal and at least one server, comprising: 
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3 a memory; 

4 a network interface; and 

5 a controller, connected to the memory and the interface said controller: 

6 1) determining a current status of the at least one server, 2) determining a transmission 

7 rate of the at least one terminal based on the current status of at least one server, and 

8 3) adjusting the transmissions of the at least one terminal to the at least one server based 

9 on the transmission rate. 

1 10. The apparatus of claim 9, wherein the controller determines the current status 

2 of the at least one server by receiving an overload notification from one of at least one 

3 server and updating a local status of the one of at least one server. 

1 11. The apparatus of claim 9, wherein the controller adjusts the transmission 

2 further comprises by modifying at least one local load weight to move a load from at least 

3 one overloaded server to at least one non-overloaded server. 

1 12. The apparatus of claim 9, wherein the controller determines the current status 

2 comprises determining an overload status of each server based on whether any or all of 

3 the servers are overloaded. 

1 13. The apparatus of claim 12, wherein the controller adjusts the transmission by 

2 adjusting a local load coefficient for each server based on the overload status. 

1 14. The apparatus of claim 13, wherein the controller adjusts the local load 

2 coefficient by decreasing the local load coefficient if the overload status indicates that all 

3 of the servers are overloaded. 

1 15. The apparatus of claim 14, wherein the controller adjusts the local load 

2 coefficient by increasing the local load coefficient if the overload status indicates that 

3 none of the servers are overloaded. 

1 16. The apparatus of claim 9, wherein the controller adjusts the transmission by 

2 modifying at least one local load weight if a portion of the at least one server is 

3 overloaded. 
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ABSTRACT OF THE DISCLOSURE 
The present invention provides a throttling unit for throttling and distributing data 
transmissions between terminals and servers in a network. The throttling unit determines 
a current status of the servers in the system, typically by receiving an overload 
5 notification from each server. Once the throttling unit has determined the current status 
of the servers, the throttling unit can adjust transmission rates of terminals to control the 
amount of the data received by each server based on its corresponding status. The 
transmission rate can also include a local load coefficient which can be adjusted to either 
increase or decrease the transmission rate of a terminal to a server depending on the status 
10 of the servers. A data transmission can subsequently be sent from a terminal to the 
servers based on the transmission rate determined for each of the servers. 
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IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 

Declaration and Power of Attorney 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my 

name. 

I believe I am an original, first and sole inventor of the subject matter which is 
claimed and for which a patent is sought on the invention entitled A Method And 
Apparatus For Throttling And Distributing Data Transmissions Across A Network, 
the specification of which is attached hereto. 

I hereby state that I have reviewed and understand the contents of the above 
identified specification, including the claims, as amended by an amendment, if any, 
specifically referred to in this oath or declaration. 

I acknowledge the duty to disclose all information known to me which is material 
to patentability as defined in Title 37, Code of Federal Regulations, 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, 119 
(a-d) or 365(a-b) of any PCT or foreign application(s) for patent or inventors' certificate 
listed below or priority benefits under 119(e) of any United States provisional 
applications) listed below and have also identified below any foreign application for 
patent or inventors' certificate having a filing date before that of the application on which 
priority is claimed: 

None 

I hereby claim the benefit under Title 35, United States Code, 120 of any United 
States application(s) listed below and, insofar as the subject matter of each of the claims 
of this application is not disclosed in the prior United States application in the manner 
provided by the first paragraph of Title 35, United States Code, 1 12, we acknowledge the 
duty to disclose all information known to us to be material to patentability as defined in 
Title 37, Code of Federal Regulations, 1.56 which became available between the filing 
date of the prior application and the national or PCT international filing date of this 
application: 



None 
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I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 
18 of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 

I hereby appoint the following attorney(s) with full power of substitution and 
revocation, to prosecute said application, to make alterations and amendments therein, to 
receive the patent, and to transact all business in the Patent and Trademark Office 
connected therewith: 



Samuel H. Dworetsky (Reg. No. 27873) 

Thomas A. Restaino (Reg. No. 33444) 

Cedric G. DeLaCruz (Reg. No. 36498) 

Michele L. Conover (Reg. No.34962) 

Rohini K. Garg (Reg. No. 45272) 

Robert B. Levy (Reg. No. 28234) 

Benjamin S. Lee (Reg. No. 42787) 

Susan E. McHale (Reg. No. 35948) 

Alfred G. Steinmetz (Reg. No. 22971) 



I also appoint James A. Oliff (Reg. No. 27075), William P. Berridge (Reg. No. 
30024), Kirk M. Hudson (Reg. No. 27562), Thomas J. Pardini (Reg. No. 30411), Paul 
Tsou (Reg. No. 37956) and John S. Kern (Reg. No. 42719) of Oliff & Berridge as 
associate attorneys, with full power to prosecute said application, to make alterations and 
amendments therein, and to transact all business in the Patent and Trademark Office 
connected therewith. 

Please address all correspondence to Mr. S. H. Dworetsky, AT&T Corp., P.O. 
Box 41 10, Middletown, New Jersey 07748. Telephone calls should be made to Robert B. 
Levy at 908-221-5714. 
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Citizenship: United States of America 

Post Office Address: 29 Georgian Bay Drive 

Morganville, New Jersey 0775 1 



